Topology-Aware Communication in Wide-Area Message-Passing

نویسنده

  • Craig A. Lee
چکیده

This position paper examines the use of topology-aware communication services to support message-passing in wide-area, distributed environments, i.e., grids. Grid computing promises great benefits in the flexible sharing of resources but poses equally great challenges for highperformance computing, that is to say, how to execute large-scale computations on a grid with reasonable utilization of the machines involved. For wide-area computations using a message-passing paradigm, these issues can be addressed by using topology-aware communication, i.e., communication services that are aware of and can exploit the topology of the network connecting all relevant machines. Such services can include augmented communication semantics (e.g., filtering), collective operations, content-based and policy-based routing, and managing communication scope to manage feasibility. While such services can be implemented and deployed in a variety of ways, we propose the use of a peer-to-peer, middleware forwarding and routing layer. In a related application domain (time management in distributed simulations) we provide emulation results showing that such topology-awareness can play a major role in performance and scalability. Besides these benefits, such communication services raise a host of implementation and integration issues for their operational deployment and use in grid environments. Hence, we discuss the need for proper APIs and high-level models.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Design and Implementation of Adaptive Message Passing Systems for Wide-Area Distributed Computing Environments

Recently, much research has gone into wide-area message passing systems, but more work is necessary so that message passing systems can adapt to wide-area environments by themselves and stop requiring manual configuration. Thus, in this paper, I make two proposals concerning the design and implementation of adaptive message passing systems for wide-area, distributed environments. My first propo...

متن کامل

Collective Operations for Wide-Area Message Passing Systems Using Dynamically Created Spanning Trees

We propose a configuration-free method to perform collective operations efficiently in dynamically changing topologies. Our collective operations are designed so that (1) they perform well when the topology is stable, (2) they complete successfully even when processors join or leave, and (3) they adapt to topology changes. We propose to create adaptive latency-aware spanning trees for short mes...

متن کامل

Dynamic Topology Selection for High Performance MPI in the Grid Environments

MPI (Message Passing Interface) is getting more popular and important even in the Grid, but its performance still remains a problem, which is caused by the communication bottleneck on wide area links. To overcome such performance wall problem, we propose a dynamic topology selection which is a kind of resource selection method. It provides an effective resource selection service based on four p...

متن کامل

Collective operations for wide-area message passing systems using adaptive spanning trees

We propose a method for wide-area message-passing systems to perform broadcasts and reductions efficiently using latency and bandwidth-aware spanning trees constructed at run-time. These trees are updated when processes join or leave a computation, allowing effective execution to continue. We have implemented our proposal on the Phoenix Message-Passing Library and performed experiments using 16...

متن کامل

An Efficient Group Communication Architecture over ATM Networks

NYNET (ATM wide-area network testbed in New York state) Communication System (NCS) is a multithreaded message-passing tool developed at Syracuse University that provides low-latency and high-throughput communication services over Asynchronous Transfer Mode (ATM)-based highperformance distributed computing (HPDC) environments. NCS provides exible and scalable group communication services based o...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003